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Abstract 


In  this  paper  we  consider  the  selection  and  ranking  problem  in  a  nonpara¬ 
metric  setup  when  the  populations  111,112, are  characterized  by  func¬ 
tionals  of  the  associated  distribution  functions  ${Fi),0{F2), . . .  ,0(Fk),  where 
0{Fi)  =  / gidFi,  for  *  =  and  gi,92,...,gk  are  known  bounded  func¬ 

tions.  The  problems  of  selecting  the  best  population  under  the  indifference 
zone  approach  and  the  subset  selection  approach  are  considered.  Approximate 
non-randomized  rules  are  obtained.  Finally,  some  simulation  studies  concerning 
these  procedures  are  given. 
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1  Introduction 


In  many  practical  situations,  the  experimenter  often  faces  the  problem  of  comparing 
several  competing  populations,  treatments  in  clinical  trials  or  processes.  The  selection 
and  ranking  methodolo^Ljof  ranking  and  selection  provides  the  useful  techniques  for 
solving  such  problems.  There  have  been  two  main  approaches  to  selection  and  ranking 
problems,  the  indifference  zone  approach  -due-td-Bechhofer  (19545  and  the  subset 
selection  approach -due  to  Gupta  (195€f5L  In  the  indifference  zone  approach  a  single 
popiilation  is  chosen  and  is  guaranteed  to  be  the  best  (worst)  with  probability  at  least 
equal  to  P*.  However,  in  this  formulation  it  is  assumed  that  the  best  population 
is  sufficiently  apart  from  the  remaining  k  —  1  populations.  In  the  subset  selection 
approach  no  such  restriction  on  the  parameter  space  is  asstuned.  A  random  size 
subset  of  k  populations  is  chosen  which  is  guaranteed  to  contain  the  best  (worst) 
population  with  probability  at  least  equal  to  P*.  In  this  approach  the  data  or  the 
outcome  of  the  experiment  is  used  to  decide  on  how  many  populations  to  selectT^For 
an  extensive  review  of  these  formulations  see  Gupta  and  Panchpakesan  (1979)  ^nd 
Gupta  and  Panchpakesan  (1986).  ' t  ir  c  <’ ' 

* /••  M  f  <' ■  ^ 

s. 

Often  in  practice,  especially  for  the  new  treatments,  or  for  expensive  products 
there  is  not  much  information  (the  past  data)  which  could  lead  us  to  assume  a  para¬ 
metric  model.  In  this  paper  we  consider  a  ranking  and  selection  problem  in  a  non- 
parametric  setup.  Considerable  amount  of  work  has  been  done  on  the  problems  of 
selecting  population  associated  with  the  largest  a-th  quantile  (or  the  largest  location 
parameter)  or  selecting  a  subset  of  the  populations  which  contains  the  population  as- 
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sociated  with  the  largest  a-th  quantile  (or  location  parameter).  Some  references  are 
Barlow  and  Gupta  (1969),  Gupta  and  McDonald  (1970),  Gupta  and  Huang  (1974), 
Rizvi  and  Sobel  (1967),  and  Sobel  (1967).  An  extensive  review  of  non-parametric 
selection  and  ranking  procedures  is  in  Desu  and  Bristol  (1986). 

To  formulate  the  problem,  let  ni,n2, •••,11*  be  the  k  independent  populations. 
The  population  H,-  is  associated  with  the  cumulative  distribution  function  F,(.)  on  W’, 
for  t  =  1,2, . . . ,  A:.  The  population  II, •  is  characterized  by  the  real- valued  functional, 

H^i)  =  /  gi{x)dFi{x)  ; 

JRf 

where  g,  is  a  known,  real-valued  bounded  function  on  In  this  paper  we  obtain 
the  “optimal”  classical  type  procedures.  Non-randomized  procedures  are  proposed. 
It  is  also  shown  that  the  proposed  non-randomized  selection  procedures  are  “close” 
to  the  optimal  procedures.  A  lower  boimd  for  the  probability  of  a  correct  selection 
is  also  obtained.  The  non-parametric  procedures  which  are  developed  in  this  paper 
are  robust  and  may  also  be  used  to  do  the  preliminary  analysis.  We  believe  these 
procedures  would  be  of  xise  in  many  selection  and  ranking  problem  where  the  dis¬ 
tribution  functions  associated  with  the  populations  do  not  possess  “nice”  properties. 
Some  Monte  Carlo  results  axe  presented  in  the  Section  4. 

2  Indifference  Zone  Approach 

In  this  section  we  consider  the  problem  of  selecting  the  best  (worst)  population  under 
the  indifference  zone  approach.  The  goal  is  to  select  the  best  population  with  proba¬ 
bility  at  least  P*,  provided  that  the  “distance”  between  the  best  population  and  the 
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remaining  ^  —  1  populations  is  at  least  d,  where  d  is  some  positive  number  specified 
by  the  experimenter. 

As  defined  before,  let  Hi, IIj, . . . , Ilfc  be  the  k  populations.  First  we  consider  the 
problem  of  selecting  the  best  population  among  k  population  when  the  population  11,- 
is  characterized  by  the  functional  0(Fi)  =  /  gdFi,  for  *  =  1,2,  ...,k  and  we  are  inter¬ 
ested  in  selecting  large  (small)  values  of  6.  If  necessary,  we  make  the  transformation 

9  -  inf  g 

^  supflf-inf^’ 

and,  without  any  loss  of  generality  assume  that  sup  ^(z)  =  1  and  inf  </(x)  =  0.  Let 
^[1]  %-i]  ^  %]  ordered  values  of  Oi,02,. . .  ,Bk.  The  correct 

pairing  between  ordered  and  unordered  ^‘s  is  completely  unknown.  The  population 
corresponding  to  0[fc]  is  called  the  best  population,  in  case  of  ties  we  assume  that  one 
of  them  is  tagged  to  be  the  best  population.  Our  goal  is  to  select  the  best  population 
with  probability  of  correct  selection  at  least  P*.  We  need  to  define  some  notations. 

Let 

F  =  {{Fi, F2,..., Fk) :  Fi  is  distribution  on  BF  }  . 

In  general,  if  we  allow  F  to  take  any  value  in  F  then  there  does  not  exist  a  procedure 
which  would  satisfy  the  P*  condition,  hence  we  need  to  restrict  the  space.  Let  d  be 
a  real  number  in  the  interval  (0,1)  and  define,  following  Bechhofer  (1954), 

0'  =  {(^1,  ^2,  •  •  • ,  ^fe)  :  0[k]  -  %-i]  > 

and 

F={F:  9{F)  €  0'}  . 
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Correct  selection  (CS)  :  Selecting  the  best  population 

Goal:  For  given  P*  (1/A:  <  P*  <  1),  find  a  procedure  R  such  that  for  any  n; 

PFiCS\R,n)>P^  for  every  P  e  ,  (1) 

where  Pf(CS\R,  n)  denotes  the  probability  of  a  correct  selection  for  the  procedure 
R.  The  above  condition  is  called  the  P*— condition. 

In  dealing  with  the  above  problem,  we  need  to  introduce  some  notations.  Let 
Pi  =  (Pii,Pi2,---,p.n);  p  =  where  p,j  >  0  for  i  =  1,2,..., A;  and 

for  j  =  1,2, ...,n.  Let  be  the  independent  Bernoulli  random  variables  with 
parameters  p,y  for  i  =  1, 2, . . . ,  A:  and  j  =  1, 2, . . . ,  n,  and  let  ,  f/2,  •  •  • ,  be  the  k 
independent  uniform  random  variables  on  interval  (0, 1/2).  Let 

s, = E  z« + i/i. 

)■=! 

Define 

Mp)  =  Sj).  (2) 

Now  let  Xii,Xi2, ,  Xin  be  the  observable  independent  random  vectors  from  the 
population  Hj,  for  i  =  1, 2, ...  A:.  Let  X  =  (Xii,Xi2, . Xkn),  and  let 

j(J0  =  . s,(X,„)). 

Now  we  propose  the  following  selection  procedures. 

Procedure  Pi: 

Select  one  of  the  populations  IIi,  n^, . . . ,  11*  with  probabilities  i/fi(g(X)),  V’2(p(^))5  •  •  • ,  '•Pkigi^ 
respectively. 
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A  natural  non-randomized  version  of  this  procedure  is: 

Procedure  R2' 

Select  the  population  H,-  for  which 

randomize  in  case  of  ties. 

Notice  that  the  procedure  R\  is  randomized  procedure  and  the  procedure  R2  is  a 
non-randomized  (randomization  for  ties  considered)  version  of  procedure  R\. 

First  we  prove  that  the  decision  rule  S{X)  =  (t^i(g(X)),  t^2(g(X)), . . . ,  i^k(g(X))) 
is  “optimum”  decision  rule  for  selecting  the  best  population  among  k  populations. 


Theorem  2.1  ; 

The  procedure  Ri  maximizes  the  infimum  of  the  probability  of  a  correct  selection,  i.e. 
If  R'  is  any  other  selection  procedure  then 


M^Pf(,CS\R)  <  ^Pp{CS\R,). 


Proof: 

Observe  that  inf^(x)  =  0  and  sup5(x)  =  1. 

Fix  e  >  0,  and  get  a  and  b  such  that  y(a)  =  ci  ,  ^(6)  =  1  —  C2  and  0  <  €1  -t-  €2  <  c- 
Let  Pi  be  the  counting  probability  measure  induced  by  a  distribution  function  F,. 
Define 


0  S  P«  <  1;  for  t  =  1 , 2 . . . ,  A: 


nF'. 


(3) 
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For  i  =  1,2,  ...k  ,  define 


Ai  =  {Xij  :  Xij  =  b  j  =  1,2,..., n.)  and  Ti(X)  =  (i4,|. 

Note  that  for  a  class  of  distribution  functions  the  statistics  T  =  (7i 
is  a  complete  sufficient  statistic. 

We  also  note  that  Ti,T2, . . .  ,Tk  are  independent  and  they  have  binomial  distri¬ 
butions  with  parameters  in,pi),{n,p2),...,{n,pk),  respectively.  Since  the  binomial 
distribution  has  the  monotone  likelyhood  ratio  property,  it  is  easy  to  see  that  for 
every  invariant  prior,  a  rule  which  selects  populations  with  largest  T,-  (randomize  in 
case  of  ties)  is  a  Bayes  rule  for  0-1  valued  loss  function.  Also  notice  that  the  risk 
function  of  the  procedure  Ri  is  same  as  the  risk  function  of  the  Bayes  rule. 

Hence 

inf  PF(CS|ir)<  inf  Pf(C5|fi,). 

Since  c  is  arbitrary,  letting  e  — ^  0  the  result  follows. 

Remark  2.1  : 

From  this  theorem  we  see  that  the  procedure  Ri  is  the  “most  economical”  in  the 
sense  that  for  a  given  P*  and  d  there  doesn’t  exist  amy  other  procedure  which  can 
meet  the  basic  probability  requirement  with  a  smaller  sample.  This  was  also  proved 
in  a  special  case  by  Hall  (1958) 

Theorem  2.2  ; 

flj  PFiCS\Ri,n)  is  increasing  in  n  and 

[2]  PF{CS\Rx,n)  is  increasing  function  ofO^k]  provided  0[x],  0[2],.  •  held  fixed. 

[3]  inf^e^  PF{CS\Ri,n)  — ►  1  <wn  — ►  oo. 
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Proof: 


It  is  straightforward  to  see  that 

Pf{CS\Rx,  n)  =  P(n«  +  =  max  (^n  +  Uj)),  (4) 

where  Fi„,  Vim  •  •  •  >  Hn  are  independent  binomial  random  variables  with  parameters 
(n,  ^[1]),  (n,  ^[2]), . . . ,  (n,  ^[fc])  respectively,  and  U\,U2,  • . .  tUk  are  independent  uniform 
random  variables  over  the  interval  (0,1/2).  If  we  consider  the  problem  of  selecting 
the  best  population  among  k  binomial  populations,  the  procedure  which  selects  the 
population  Ilj  for  which  V^.n  +  t/,  =  maxj  Vj,„  +  Uj  is  the  best  invariant  and  is 
a  Bayes  procedure  with  respect  to  every  invariant  prior  on  0',  provided  that  the 
underlying  loss  function  is  permutation  invariant,  “monotone”  (more  loss  for  selecting 
bad  population)  and  nonnegative.  Hence  the  Bayes  risk  of  the  procedure  Ri  decreases 
as  n  increases  for  every  permutation  invariant  prior  on  ©'.  Thus  PF{CS\Ri,n)  is 
increasing  in  n.  From  equation  (2)  it  is  clear  that  pF{CS\R\,n)  is  an  increasing 
function  of 

The  third  result  is  an  immediate  consequence  of  the  strong  law  of  large  numbers. 
This  completes  the  proof  of  the  theorem. 

The  above  theorem  insures  that  for  a  given  P*,  there  exists  no{P*,  k,d)  such  that 

PF(C^<S'|i?i,n)  >  P*  for  every  F  G  F'. 

The  procedure  R\  has  nice  properties,  however  it  is  a  randomized  procedure. 
In  practice  the  experimenter  would  like  to  use  a  non-randomized  procedure.  The 
procedure  R2  is  a  non-randomized  version  of  R\.  The  following  theorem  gives  the 
relationship  between  Pf{CS\R\)  and  Pf{CS\R2)- 
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Theorem  2.3  : 


For  every  F  Q.  F’ 


Pf{,CS\R^)  >  2  Pf{CS\Ri)  -  1. 


Proof:  Let  Hi  be  the  best  population,  and  /  be  a  indicator  function  then 


(5) 


P{CS\R2) 


smaxi  A(b(x))) 


>  j  rl>i(g{x))  -  m&xrlPi{g{x))dF 

=  J  M9ix))dF  -  j  r^rj}i{g{x))dF 

=  PFiCS\Ri)  -  [  m&xij}i{g{x))dF 

J 

>  Pf{CS\Ri)-  fj2MKx))dF 

=  Pf{CS\Ri)  -  /(I  -  MHx)))dF 


=  Pf{CS\Ri)  -  1  +  Pf  (C5|/2x) 
=  2  PF(C5|i2x)  -  1. 


This  proves  the  theorem. 


Remark  2.2  : 

From  Theorems  2.2  and  2.3  it  follows  that,  for  every  P  € 


PF{CS\R2jn)  — ¥  1  as  n  — y  oo. 

Remark  2.3  : 

Observing  the  method  of  the  proof  of  the  above  theorem,  we  note  that,  the  above 
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result  holds  for  any  multiple  decision  problem  with  0-1  loss,  Ri  is  any  procedure  and 
the  procedure  R2  is  a  “non-randomized  version”  of  the  procedure  Ri. 

As  we  noticed  before  we  can  generalize  the  procedures  Ri  and  R2  to  obtain  the 
procedure  for  selecting  the  best  population  with  highest  parameter,  when  the  popu¬ 
lation  n,-  is  characterized  by  the  functional 

e(Fi)  =  J  gidFi-, 

where  gi  is  a  known  real- valued  function  with  infx^i(x)  =  0  and  supj,^,(x)  =  1,  for 
i  =  1, 2, . . . ,  fc.  This  can  be  done  in  the  following  way. 

Define 

f  (®)  =  <71(^12),  •  .  •  ,  ^2(X2l), .  .  .  ,  ^(X2n), .......  9k{Xkl), .  .  .  ,  9k{Xkn))- 

Let  V’lj  V’2)  •  •  • » V’fe  as  by  equation  (2). 

Procedure  R3  : 

Select  one  of  the  populations  Hi,  112,  •  •  •  >  D*  with  probabilities  ^i{g{x)),  V’2(5(x)), . . .  V’fc(^(x)), 
respectively. 

A  non-randomized  version  of  this  procedure  is  given  by 
Procedure  R4  : 

Select  the  population  IT,-  for  which 

V’i(l(x))  =  i^^V>j(l(x)) 

and  randomize  in  case  of  ties. 

Theorem  2.1  ,  Theorem  2.2  ,  Theorem  2.3  and  the  above  remarks  hold  true  for  these 
procedures  also. 
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Theorem  2.3  indicates  that  the  procedure  R2  {R4)  is  a  “good”  approximate  non- 
randomized  version  of  procedure  Ri  {R3),  whenever  P*  is  large,  and  that  is  the  case  in 
general.  For  example,  if  Pf(CS\Ri)  >  0.99  then  Pf(CS\R2)  >  0.98.  The  procedure 
is  good,  in  the  sense  that  we  lose  at  most  1—P*  due  to  ncn-randomization.  We  also 
note  that  these  procedures  can  be  generalized  to  the  problem  of  selecting  the  t  best 
populations. 

As  given  by  equation  (4)  the  probability  of  a  correct  selection  can  be  written 
in  terms  of  the  binomial  probabilities.  The  sample  sizes,  no{P*,k,d)  (exact  and 
approximate)  are  tabulated  by  Sobel  and  Huyett  (1957)  for  k  =  2,3,4,10,  d  = 
0.05(0.05)0.5  and  P*  =  0.5,0.6,0.75,0.90,0.95,0.99.  For  k  =  2  they  conjectured  that 
the  least  favorable  configuration  occurs  at  6[2]  =  (1  +  <0/2  and  0[i]  =  (1  —  <0/2.  This 
conjecture  is  shown  to  be  true  by  Eaton  and  Gleser  (1989). 

3  Subset  Selection  Approach 

In  the  subset  selection  approach  we  select  a  random  size  subset  of  the  k  populations 
which  contains  the  best  population  with  probability  P*  (1/A  <  P*  <  1).  The  main 
feature  of  selecting  a  subset  of  random  size  is  to  allow  the  size  to  be  determined  by  the 
observations  themselves.  Also  in  the  subset  selection  approach  we  need  not  assume 
any  restriction  on  the  “parameter  space”. 

Now  we  describe  the  problem  formally,  let  us  assume  that  there  are  k  populations 
III,  n2, . . . ,  Ilk.  The  random  variable  associated  with  population  ITj  has  the  cumula¬ 
tive  distribution  function  F,(.)  on  R’’.  Again  the  characterizing  function  is  real-valued 
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as  defined  earlier.  Let  0[i]  <  ^[2], . . . ,  <  ^[fc]  be  the  ordered  values  of  ^1,  ^2)  •  •  • » ^k-  The 
population  associated  with  d[k\  is  called  the  best  population,  in  case  of  ties  one  of 
them  is  tagged  as  the  best  population.  Our  goal  is  to  select  a  non  empty  subset  of 
these  k  populations  so  that  the  selected  subset  includes  the  population  associated 
with  with  large  probability.  Let  CS  denote  the  event  of  correct  selection  and 
P(CS\R)  denote  the  probability  of  correct  selection  for  the  procedure  R. 

CS:  Selecting  a  subset  of  k  populations  which  contains  the  best  population. 

Goal:  Find  a  subsect  selection  procedure  R  for  which 

P{CS\R)  >  P\ 

Let  the  decision  space  T>  consists  of  2*  —  1  subsets  of  the  set  {1, 2, . . . ,  ik}  we  write 
this  formally  as 

U  =  {a  :  a  C  {1,2, . . . ,  A:}and  la|>l}. 

Action  a  =  •  •  •  ,tr}  €  V  corresponds  to  the  selection  of  the  populations 

n,, ,  n,j, . . . ,  n,v.  a  decision  “a”  is  called  a  correct  selection  (CS)  if  the  best  pop¬ 
ulation  is  included  in  the  selected  subset.  We  implement  the  procedures  established 
by  Gupta  and  Sobel  (1960)  for  selecting  a  subset  of  k  binomial  populations  con¬ 
taining  the  best  population.  To  define  the  procedures  we  need  some  notation.  Let 
Pi  =  {Pii,Pi2,---,Pin),  0  <  pij  <  1  for  i  =  1,2, ...,A:  and  for  j  =  1,2, ...,n.  Let 
P=^  (Pl,P2,--.,Pfc). 

For  every  a  ^T>  define 

V’o(p)  =  PiminSi  >  max  Sj  —  d>  max  5,),  (6) 
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where  Si  =  IZ”_i  Zij  for  t  =  1, 2, . . . ,  fc.  For  *  =  1,2,,..,/:  and  for  j  =  1, 2, . . . ,  n  Zij 
are  independent  Bernoulli  random  variables  with  P{Zij  =  1)  =  pij.  Let  Xu ,  X,-2, . .  • ,  Xi„ 
be  the  observable  random  vectors  form  population  !!,•  for  *  =  1, 2, , , . ,  A:.  Let 

=  {9{xn),  9{xi2),  . . . ,  •  •  • ,  9ixki),  9ixk2),  •  •  • ,  <7(xfen)). 

Procedure  R,  : 

Having  observed  X  =  x,  select  a  subset  of  populations  H,-, ,  11^,, . . . ,  Hr  with  prob¬ 
ability  V»a(^(x)),  where  a  =  {*i,  *2, , . . ,  *r}. 

Theorem  3.1 


inf  Pf{CS\R  =  inf  PsiYx  >  max  Yi  -  d) 

FqF  '  '  '  0<fl<l  '  l<i<k  ’ 


(7) 


where  Yi,Y2,...,Yk  are  i.  »,  d.  binomial  random  variables  with  parameter  (n, 6). 


Proof: 

Let  Hi  be  the  best  population  then 


=  Ef'E  MX) 

a:l€a 

=  EfP{S\  >  m^S",  —  d\X  =  x), 

where  Si  =  127=1  given  X  =  x  ,  Zij  ‘s  are  independent  Bernoulli  ran¬ 

dom  variables  with  P(,Zij  =  llA"  =  x)  =  g{xij).  Hence  marginally  5i,  ^2, , . . ,  5*  are 
independent  binomial  random  variables  with  parameters  (n, ^i), (n,/?2))--->(»i»^*)> 
respectively. 

Hence  we  have 


inf  PHC'5|i?.)  =  inf 
F€^  '  '  ^  i<e.< 


P(Si  >  max  Sj 

i<j<k 


-d). 
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From  Gupta  and  Sobel  (1960)  we  know  that 


inf  P(Si  >  max  Sj  —  d)  =  inf  PiSi  >  max  5,  —  d). 

This  completes  the  proof  of  the  theorem. 

In  the  case  of  A:  =  2,  Gupta  and  Sobel  (1960)  proved  that 

inf  PiSi  >  max  Sj  —  d) 

fli=«j=...=9*  —  1<J<* 

is  attained  at  6i  =  62  =  1/2.  For  k  >  2,  the  common  value  60  at  which  infimum 
takes  place  is  not  known.  The  conservative  values  of  d  based  on  the  normal  approx¬ 
imation  have  been  tabulated  by  Gupta  and  Sobel  (1960)  for  k  =  2(1)20(5)50,  n  = 
1(1)20(5)50(10)100(25)200(50)500  and  P*  =  0.75,0.90,0.95,0.99.  Gupta,  Huang  and 
Huang  (1976)  obtained  consurvative  values  of  d  when  P*  —  0.75,0.90,0.95,0.99  and 
n  =  1(1)4  when  k  =  3(1)15,  and  n  s=  5(1)10  when  k  =  3(1)5. 

The  procedure  R,  is  randomized,  the  non-randomized  version  of  this  procedure  is 
given  by 
Procedure  : 

After  observing  X  =  x  select  a  subset  of  populations  Hj, ,  n,j, . . . ,  H,  if 

V’«(^(a;))  =  r^xMgix)), 

where  a  =  {*i,  *2?  •  •  •  j *r}t  randomize  in  case  of  ties. 

As  in  the  previous  section  we  can  generalize  these  subset  selection  procedures  when 
population  H,-  is  char2M:terized  by  the  functional  0{Fi)  =  f  gtdPi,  for  i  =  1,2,  ...fc. 

As  in  the  indifference  zone  approach  case,  we  are  not  been  able  to  get  lower  bound 
for  the  probability  of  correct  selection  of  procedure  i?'.  We  feel  however  that,  there 
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exists  a  constant  c  =  c(n)  and  a  non  trivial  subset  of  T  such  that 


Pf{CS\K)  >  PFiCS\Rs)  -  c(l  -  Pf{CS\R,)), 

VF  €  Fo. 

4  Examples 

Let  us  suppose  that  there  are  k  populations,  Hi,  112, .  •  • ,  Ilfc,  associated  with  distri¬ 
butions  functions,  Ft,  F2, . . . ,  Fk,  respectively.  In  this  section  we  will  present  some 
examples  and  the  Monte  Carlo  results.  Standard  errors  for  all  the  estimates  is  less 
than  0.035  . 

Example  4.1  : 

Let 

fi{x)  =  for  i  = 

where  /,•  is  the  density  associated  Mrith  F,-  for  t  =  1,2,  ...,k.  We  want  to  select 
the  population  associated  with  Take  g  as  c.d.f.  of  double  exponential  random 
variable  with  parameter  fi.  The  problem  of  selecting  the  population  with  the  highest 
location  parameter  is  same  as  the  problem  of  selecting  the  population  with  the  highest 
functional.  Now  we  will  use  the  nonparametric  procedures  and  make  comparisons. 
Let 

Ri".  The  nonparametric  rule. 

R2  :  Non  randomized  version  of  the  rule  Ri. 
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Rmedian  ■  Selects  the  population  associated  with  the  highest  median. 
Rmean  •  Selects  the  population  associated  with  the  highest  mean. 


Ms  =  1, 

n  =  13, 

P{CS) 

Ax  =  2 

/X  =  0.75 

/x  =  0, 

Ri 

0.4823 

0.6815 

0.665 

R2 

0.690 

0.882 

0.880 

Rmedian 

0.887 

0.887 

0.887 

In  practice  we  may  not  know  the  configuration;  then  we  estimate  and 
by  sample  medians  and  set  /x  =  estimate  of  +  3/4(^[jk)  —  /X[fc_i)).  We  will  take 
n  =  23 


P(CS\Ei)  =  0.69 
P{CS\R2)  =  0.89 
PiCS\Rm.dian)  =  0.93 


Example  4.2  : 


Let 


Fiix)  = 


1 


1  +  e-{x-fii) 


,  for  i  =  1,2, . . . ,  fc. 


=  /X2  =  A*3  =  ^<4  =  0,  /Xs  =  1 

g  —  c.d.f  of  logistic,  /X  =  0.75 


PiCS\Ri)  =  0.550 
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P{CS\R2)  =  0.806 
P{CS\R^^ian)  =  0.748 
PiCS\Rmean)  =  0.803 

As  in  the  previous  example  we  will  estimate  and  and  by  sample  medians 
and  set  fi  =  estimate  of  /<[fc-ij  4-  3/A(fi[k]  —  m[ifc_i]).  Let  n  =  23 


P{CS\Ri)  =  0.69 
PiCSlRi)  =  0.92 
P(CS\Rmedian)  =  0.86 
P(CS\Rmean)  =  0.91 

In  above  examples  we  observe  that  P(C75|i21)  <  P{CS\R2).  These  results  indi¬ 
cate  that  the  nonrandomized  version  of  the  nonparametric  procedure  would  be  better 
when  the  associated  distribution  functions  are  stochastically  increasing  in  the  param¬ 
eters. 

This  can  be  proved  for  k  =  2  and  n  =  1  in  the  location  parameter  case.  Let 
F(.)  be  the  associated  distribution  function,  Xi  be  the  observable  random  variable 
from  the  population  Hi  with  location  parameter  ni  and  X2  be  the  observable  random 
variable  from  population  11  j  with  location  parameter  H2-  Let  g  be  the  distribution 
function  of  A3.  Let  111  be  the  best  population. 

Then 

^((751/?.)  =  )(1 -i,(jr,))  +  is(X,)s(Jf,)+ 1(1 ))(!-, (AT,))] 
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=  E[s(X,)  -  aiX.MX,)  +  p(XMx,)] 

+E[\  -  i(«(A:.)  +  g(X,))  +  \g(X^)g^X^)] 

=  iEWX,)-j(;f,)  +  l]. 

Let  Zi  and  Z2  be  the  independent  random  variables  with  common  distribution  func¬ 
tion  F{.).  Set  Z  =  Z\  —  Zj.  We  have  the  following; 

P{CS\R,)  =  l[P(^Z>-tir  +  ti2)-P{Z>Q)  +  l] 

=  ^Pi—f^i  +  fi2  <  Z  <Q)  +  \] 

Since  P{Z  >  0)  =  P{Z  <  0)  =  1/2,  we  have, 

=  +  /i2  <  Z  <  0)  +  P(Z  >  0). 

It  is  straightforward  to  see  that 

P{CS\R2)  =  P(Z>-mi+M- 

Hence 

P{CS\R2)  >  P{CS\Rx) 

if  and  only  if 

P{Z  >  —fii  -f-  (i2)  >  —P{—fii  -j-  fi2  <  Z  <  0)  +  P{Z  >  0) 
which  is  true  if  and  only  if 

P{Z  >  —fix  -|-  fi2)  —  P{Z  >  0)  =  P{—fii  +  fl2  <  Z  <0)  >  -P{—fli  +  fi2  <  Z  <  0), 
which  is  always  true. 
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